Search CORE

25 research outputs found

Inducing a decision tree with discriminative paths to classify entities in a knowledge graph

Author: De Turck Filip
Ongenae Femke
Steenwinckel Bram
Vandewiele Gilles
Publication venue
Publication date: 01/01/2019
Field of study

MINDWALC : mining interpretable, discriminative walks for classification of nodes in a knowledge graph

Author: De Turck Filip
Ongenae Femke
Steenwinckel Bram
Vandewiele Gilles
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Background Leveraging graphs for machine learning tasks can result in more expressive power as extra information is added to the data by explicitly encoding relations between entities. Knowledge graphs are multi-relational, directed graph representations of domain knowledge. Recently, deep learning-based techniques have been gaining a lot of popularity. They can directly process these type of graphs or learn a low-dimensional numerical representation. While it has been shown empirically that these techniques achieve excellent predictive performances, they lack interpretability. This is of vital importance in applications situated in critical domains, such as health care. Methods We present a technique that mines interpretable walks from knowledge graphs that are very informative for a certain classification problem. The walks themselves are of a specific format to allow for the creation of data structures that result in very efficient mining. We combine this mining algorithm with three different approaches in order to classify nodes within a graph. Each of these approaches excels on different dimensions such as explainability, predictive performance and computational runtime. Results We compare our techniques to well-known state-of-the-art black-box alternatives on four benchmark knowledge graph data sets. Results show that our three presented approaches in combination with the proposed mining algorithm are at least competitive to the black-box alternatives, even often outperforming them, while being interpretable. Conclusions The mining of walks is an interesting alternative for node classification in knowledge graphs. Opposed to the current state-of-the-art that uses deep learning techniques, it results in inherently interpretable or transparent models without a sacrifice in terms of predictive performance

Ghent University Academic Bibliography

A generalized matrix profile framework with support for contextual series analysis

Author: De Paepe Dieter
De Turck Filip
Janssens Olivier
Ongenae Femke
Steenwinckel Bram
Van Hoecke Sofie
Vanden Hautte Sander
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

The Matrix Profile is a state-of-the-art time series analysis technique that can be used for motif discovery, anomaly detection, segmentation and others, in various domains such as healthcare, robotics, and audio. Where recent techniques use the Matrix Profile as a preprocessing or modeling step, we believe there is unexplored potential in generalizing the approach. We derived a framework that focuses on the implicit distance matrix calculation. We present this framework as the Series Distance Matrix (SDM). In this framework, distance measures (SDM-generators) and distance processors (SDM-consumers) can be freely combined, allowing for more flexibility and easier experimentation. In SDM, the Matrix Profile is but one specific configuration. We also introduce the Contextual Matrix Profile (CMP) as a new SDM-consumer capable of discovering repeating patterns. The CMP provides intuitive visualizations for data analysis and can find anomalies that are not discords. We demonstrate this using two real world cases. The CMP is the first of a wide variety of new techniques for series analysis that fits within SDM and can complement the Matrix Profile

Ghent University Academic Bibliography

Self-learning algorithms for the personalised interaction with people with dementia

Author: De Backere Femke
De Turck Filip
Nelis Jelle
Ongenae Femke
Steenwinckel Bram
Publication venue
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

Conditional constraints for knowledge graph embeddings

Author: Bonte Pieter
De Turck Filip
Ongenae Femke
Steenwinckel Bram
Weyns Michael
Publication venue
Publication date: 01/01/2020
Field of study

Ghent University Academic Bibliography

A dynamic dashboarding application for fleet monitoring using semantic web of things technologies

Author: De Paepe Dieter
Moens Pieter
Ongenae Femke
Steenwinckel Bram
Van Herwegen Joachim
Van Hoecke Sofie
Vanden Hautte Sander
Verstichel Stijn
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

In industry, dashboards are often used to monitor fleets of assets, such as trains, machines or buildings. In such industrial fleets, the vast amount of sensors evolves continuously, new sensor data exchange protocols and data formats are introduced, new visualization types may need to be introduced and existing dashboard visualizations may need to be updated in terms of displayed sensors. These requirements motivate the development of dynamic dashboarding applications. These, as opposed to fixed-structure dashboard applications, allow users to create visualizations at will and do not have hard-coded sensor bindings. The state-of-the-art in dynamic dashboarding does not cope well with the frequent additions and removals of sensors that must be monitored—these changes must still be configured in the implementation or at runtime by a user. Also, the user is presented with an overload of sensors, aggregations and visualizations to select from, which may sometimes even lead to the creation of dashboard widgets that do not make sense. In this paper, we present a dynamic dashboard that overcomes these problems. Sensors, visualizations and aggregations can be discovered automatically, since they are provided as RESTful Web Things on a Web Thing Model compliant gateway. The gateway also provides semantic annotations of the Web Things, describing what their abilities are. A semantic reasoner can derive visualization suggestions, given the Thing annotations, logic rules and a custom dashboard ontology. The resulting dashboarding application automatically presents the available sensors, visualizations and aggregations that can be used, without requiring sensor configuration, and assists the user in building dashboards that make sense. This way, the user can concentrate on interpreting the sensor data and detecting and solving operational problems early

Multidisciplinary Digital Publishing Institute

Ghent University Academic Bibliography

Towards adaptive anomaly detection and root cause analysis by automated extraction of knowledge from risk analyses

Author: De Paepe Dieter
De Turck Filip
Dimou Anastasia
Heyvaert Pieter
Janssens Olivier
Ongenae Femke
Steenwinckel Bram
Van Hoecke Sofie
Vanden Hautte Sander
Publication venue
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

Automated extraction of rules and knowledge from risk analyses : a ventilation unit demo

Author: De Paepe Dieter
De Turck Filip
Dimou Anastasia
Heyvaert Pieter
Janssens Olivier
Ongenae Femke
Steenwinckel Bram
Van Hoecke Sofie
Vanden Hautte Sander
Publication venue
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

FLAGS : a methodology for adaptive anomaly detection and root cause analysis on sensor data streams by fusing expert knowledge with machine learning

Author: Bentefrit Mohamed
De Paepe Dieter
De Turck Filip
Dimou Anastasia
Heyvaert Pieter
Moens Pieter
Ongenae Femke
Steenwinckel Bram
Van Den Bossche Bruno
Van Hoecke Sofie
Vanden Hautte Sander
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Anomalies and faults can be detected, and their causes verified, using both data-driven and knowledge-driven techniques. Data-driven techniques can adapt their internal functioning based on the raw input data but fail to explain the manifestation of any detection. Knowledge-driven techniques inherently deliver the cause of the faults that were detected but require too much human effort to set up. In this paper, we introduce FLAGS, the Fused-AI interpretabLe Anomaly Generation System, and combine both techniques in one methodology to overcome their limitations and optimize them based on limited user feedback. Semantic knowledge is incorporated in a machine learning technique to enhance expressivity. At the same time, feedback about the faults and anomalies that occurred is provided as input to increase adaptiveness using semantic rule mining methods. This new methodology is evaluated on a predictive maintenance case for trains. We show that our method reduces their downtime and provides more insight into frequently occurring problems. (C) 2020 The Authors. Published by Elsevier B.V

Ghent University Academic Bibliography

Deep learning models for predicting RNA degradation via dual crowdsourcing

Author: Amer Karim
Chiu King Yuen
Das Rhiju
Demkin Maggie
Fares Mohamed
Fujikawa Kazuki
Gao Jiayang
He Shujun
Ishi Keiichiro
Ito Takuya
Kim Do Soon
Kladwang Wipapat
Lee Youhan
Mao Hanfei
Nicol John J.
Noumi Taiga
Onodera Kazuki
Reade Walter
Romano Jonathan
Steenwinckel Bram
Tinti Michele
Tunguz Bojan
Vandewiele Gilles
Watkins Andrew M.
Wayment-Steele Hannah K.
Wellington-Oguri Roger
Öztürk Emin
Öztürk Fatih
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Medicines based on messenger RNA (mRNA) hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition (‘Stanford OpenVaccine’) on Kaggle, involving single-nucleotide resolution measurements on 6,043 diverse 102–130-nucleotide RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504–1,588 nucleotides) with improved accuracy compared with previously published models. These results indicate that such models can represent in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for dataset creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales

PubMed Central

University of Dundee Online Publications